Human Intent Prediction Using Markov Decision Processes
نویسندگان
چکیده
This paper describes a system for modeling human task-level intent through the use of Markov Decision Processes (MDPs). To maintain safety and efficiency during physicallyproximal human-robot collaboration, it is necessary for both human and robot to communicate or otherwise deconflict physical actions. Human-state aware robot intelligence is necessary to facilitate this. However, physical action deconfliction without explicit communication requires a robot to estimate a human (or robotic) companion’s current action(s) and goal priorities, and then use this information to predict their intended future action sequence. Models tailored to a particular human can also enable online human intent prediction. We call the former a ‘simulated human’ model – one that is non-specific and generalized to statistical norms of human reaction obtained from human subject testing. The latter we call a ‘human matching’ model – one that attempts to produce the same output as a particular human subject, requiring online learning for improved accuracy. We propose the creation of ‘simulated human’ and ‘human matching’ models in this manuscript as a means for a robot to intelligently predict a human companion’s intended future actions. We develop a Human Intent Prediction (HIP) system, which can model human choice, to satisfy these needs. This system, when given a time history of previous actions as input, predicts the most likely action a human agent will next make to a robot’s task scheduling system. Our HIP system is applied to an intra-vehicle activity (IVA) space robotics application. We use data from preliminary human subject testing to formulate and populate our models in an offline learning process that illustrates how the models can adapt to better predict intent as new training data is incorporated.
منابع مشابه
Accelerated decomposition techniques for large discounted Markov decision processes
Many hierarchical techniques to solve large Markov decision processes (MDPs) are based on the partition of the state space into strongly connected components (SCCs) that can be classified into some levels. In each level, smaller problems named restricted MDPs are solved, and then these partial solutions are combined to obtain the global solution. In this paper, we first propose a novel algorith...
متن کاملIntent-aware Multi-agent Reinforcement Learning
This paper proposes an intent-aware multi-agent planning framework as well as a learning algorithm. Under this framework, an agent plans in the goal space to maximize the expected utility. The planning process takes the belief of other agents’ intents into consideration. Instead of formulating the learning problem as a partially observable Markov decision process (POMDP), we propose a simple bu...
متن کاملTowards Guaranteeing Safe and Efficient Human-Robot Collaboration Using Human Intent Prediction
This paper describes an autonomous framework for determining a robotic manipulator’s optimal actions in real-time when interacting in close physical proximity to a human in a shared workspace environment. This framework allows the robot to purposefully choose to avoid physical and mental conflicts with a human companion while each agent performs tasks to complete their respective, separately-as...
متن کاملپیش بینی بیماریهای کبدی با استفاده از مدل مارکف پنهان
Background: The liver is the largest internal organ and the most important organ after heart and brain in the human body without which life is impossible. Diagnosis of liver disease requires a long time and sufficient expertise of the doctor. Statistical methods can be classified as an automated forecasting system and help specialists for quickly and accurately diagnose liver disease. Hidden Ma...
متن کاملSimulation and prediction of land use and land cover change using GIS, remote sensing and CA-Markov model
This study analyzes the characteristics of land use/land cover change in Jordan’s Irbid governorate, 1984–2018, and predicts future land use/land cover for 2030 and 2050 using a cellular automata-Markov model. The results inform planners and decision makers of past and current spatial dynamics of land use/land cover change and predicted urban expansion, for a better understanding and successful...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Aerospace Inf. Sys.
دوره 12 شماره
صفحات -
تاریخ انتشار 2012